Arabic phonemes transcription using data driven approach

نویسندگان

  • Khalid M. O Nahar
  • Husni Al-Muhtaseb
  • Wasfi G. Al-Khatib
  • Moustafa Elshafei
  • Mansour Al-Ghamdi
چکیده

The efficiency and correctness of continuous Arabic Speech Recognition Systems (ARS) hinge on the accuracy of the language phoneme set. The main goal of this research is to recognize and transcribe Arabic phonemes using a data-driven approach. We used the Hidden Markov Toolkit (HTK) to develop a phoneme recognizer, carrying out several experiments with different parameters, such as varying number of Hidden Markov Model (HMM) states and Gaussian mixtures to model the Arabic phonemes and find the best configuration. We used a corpus consisting of about 4000 files, representing 5 recorded hours of Modern Standard Arabic (MSA) of TV-News. A statistical analysis for the phonemes length, frequency and mode was carried out, in order to determine the best number of states necessary to represent each phoneme. Phoneme recognition accuracy of 56.79% was reached without using a language model. The recognition accuracy increased to 96.3% upon using a bigram language model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lithuanian Speech Recognition Using the English Recognizer

The present work is concerned with speech recognition using a small or medium size vocabulary. The possibility to use the English speech recognizer for the recognition of Lithuanian was investigated. Two methods were used to deal with such problems: the expert-driven (knowledgebased) method and the data-driven one. Phonological systems of English and Lithuanian were compared on the basis of the...

متن کامل

Phonetization of Arabic: rules and algorithms

One approach to the transcription of written text into sounds (phonetization) is to use a set of welldefined language-dependent rules, which are in most situations augmented by a dictionary of exceptional words that constitute their on rules. The process of transcribing into sounds starts by pre-processing the text into lexical items to which the rules are applicable. The rules can be segregate...

متن کامل

Automatic Phonetization-based Statistical Linguistic Study of Standard Arabic

Statistical studies based on automatic phonetic transcription of Standard Arabic texts are rare, and even though studies have been performed, they have been done only on one level – phoneme or syllable – and the results cannot be generalized on the language as a whole. In this paper we automatically derived accurate statistical information about phonemes, allophones, syllables, and allosyllable...

متن کامل

Difficulties of Standard Arabic Phonemes Spoken by Non - Arab Primary School Children based on Formant Frequencies

Problem statement: The study of Malaysian Arabic phoneme is rarely found which make the references work difficult. Specific guideline on Malaysian subject is not found even though a lot of acoustic and phonetics research has been done on other languages such as English, French and Chinese. Approach: This study discussed about the correct and simplest way of Arabic phonemes pronunciation in Mala...

متن کامل

Data driven multidialectal phone set for Spanish dialects

This paper addresses the use of a data-driven approach to determine a multidialectal phone set for an automatic speech recognition system for Spanish dialects. This approach is based on a decision tree clustering algorithm that tries to cluster contextual units of different dialects. This procedure avoids the definition of a global phonetic inventory and the previous study of similarity of soun...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Int. Arab J. Inf. Technol.

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2015